Identification of Divergent Functions in Homologous Proteins by Induction over Conserved Modules
نویسندگان
چکیده
Homologous proteins do not necessarily exhibit identical biochemical function. Despite this fact, local or global sequence similarity is widely used as an indication of functional identity. Of the 1327 Enzyme Commission defined functional classes with more than one annotated example in the sequence databases, similarity scores alone are inadequate in 251 (19%) of the cases. We test the hypothesis that conserved domains, as defined in the ProDom database, can be used to discriminate between alternative functions for homologous proteins in these cases. Using machine learning methods, we were able to induce correct discriminators for more than half of these 251 challenging functional classes. These results show that the combination of modular representations of proteins with sequence similarity improves the ability to infer function from sequence over similarity scores alone.
منابع مشابه
O-5: Identification of Novel ImmunodominantEpididymal Sperm Proteins Using CombinatorialApproach
Background: Alteration in the protein signatures of functionally immature testicular spermatozoa occurs during their journey through the epididymis. This leads to acquisition of sperm domain specific functions essential for successful fertilization. Epididymal sperm proteins are preferred targets for immunocontraception as well as in elucidating the causes of infertility. The Background of the ...
متن کاملPCFamily: a web server for searching homologous protein complexes
The proteins in a cell often assemble into complexes to carry out their functions and play an essential role of biological processes. The PCFamily server identifies template-based homologous protein complexes [called protein complex family (PCF)] and infers functional modules of the query proteins. This server first finds homologous structure complexes of the query using BLASTP to search the st...
متن کاملG-frames in Hilbert Modules Over Pro-C*-algebras
G-frames are natural generalizations of frames which provide more choices on analyzing functions from frame expansion coefficients. First, they were defined in Hilbert spaces and then generalized on C*-Hilbert modules. In this paper, we first generalize the concept of g-frames to Hilbert modules over pro-C*-algebras. Then, we introduce the g-frame operators in such spaces and show that they sha...
متن کاملIDENTIFICATION, ISOLATION, CLONING AND SEQUENCING APARTIALANNEXIN GENE FROM AUREOBASIDIUM PULLULANS
Background and Objectives: Annexin is the common name for genes and proteins that were identified as calcium-dependent phospholipid-binding proteins. Recently a more complex set of functions has been recognized for this superfamily of proteins including in vesicle trafficking, cell division, apoptosis, calcium signalling, mineralization, crystal nucleation inside the extracellular organelle...
متن کاملMoNetFamily: a web server to infer homologous modules and module–module interaction networks in vertebrates
A module is a fundamental unit forming with highly connected proteins and performs a certain kind of biological functions. Modules and module-module interaction (MMI) network are essential for understanding cellular processes and functions. The MoNetFamily web server can identify the modules, homologous modules (called module family) and MMI networks across multiple species for the query protei...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings. International Conference on Intelligent Systems for Molecular Biology
دوره 6 شماره
صفحات -
تاریخ انتشار 1998